37 - NHR PerfLab Seminar 2022-06-09: LARC: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache [ID:43836]
50 von 551 angezeigt

First, I want to introduce a little bit about the Rican background since some of you may

not know Rican.

I know that there are a bunch of students in the audience.

So just to give you a little bit of an overview, Rican was founded quite long ago, over 100

years ago.

It is one of the biggest research institutes in Japan.

It currently has 10 different campuses.

We are right here, one of the campuses in Kobe, which hosts the supercomputer Fugaku.

But the main office is near Tokyo in Wako.

We have over 3,000 researchers and administrators.

So quite a big research institute, quite similar to one of the DOE labs in general, Max Planck

from Germany for example.

So there's a bunch of stuff.

So we don't do only computer science.

There's biology, chemistry, physics.

We have Spring 8 for example, which is a cyclotron accelerator to do stuff like DESI in Germany

for example and other things.

And at RCCS, we basically do computer science.

So the science of computing, by computing and for computing.

So we have various research teams.

My team is only one of that, which was founded quite recently.

So we have computer science teams, we have computational science teams, which take care

of disaster mitigation, fluid dynamics and so on.

We have drug discovery teams, AI teams, and some of the teams are mainly for supercomputer

Fugaku for management operation stuff.

So there are a substantial amount of teams and researchers inside of RCCS.

And you may know a few of the recent achievements.

So Gordon Bell Prize was awarded to Rican for the fight against COVID-19.

And then we have data assimilation stuff going on where we combine real-time inputs into

climate simulations and disaster mitigation simulations.

And Fugaku also won and is currently still number one for the HPCG and the Graph 500

list.

But in the beginning, it was number one on all four titles.

So top 500, Graph, HPCG and HPL AI.

So that changed recently with the start of Frontier.

So let's look at the outline.

So after going a little bit over our center, let's look at the outline for the large cache

study we have done recently.

The motivation why we want to look at this.

Then I will introduce our hypothetical log processor and then go a little bit over the

different evaluation strategies we have developed.

And we are trying to basically see what it would bring if we would increase the amount

of cache and maybe reduce the latency as well.

And then a little bit of an outlook.

So why is biologic cache interesting?

Basically at the moment, we are a little bit at the end of a more era.

And so there are different potential paths we can go.

We can go quantum, neuromorphic.

People are looking into reconfigurable computing like CGIs, FPGAs, different pathways, how

we can combat the lack of transistor shrinking in the future.

Teil einer Videoserie :
Teil eines Kapitels:
NHR@FAU PerfLab Seminar

Zugänglich über

Offener Zugang

Dauer

01:04:07 Min

Aufnahmedatum

2022-09-06

Hochgeladen am

2022-09-13 17:06:03

Sprache

en-US

Speaker: Dr. Jens Domke, RIKEN Center for Computational Science (R-CCS), Kobe, Japan
Title: LARC: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache
Abstract: Over the last three decades, innovations in the memory subsystem were primarily targeted at overcoming the data movement bottleneck. In this talk, we focus on a specific market trend in memory technology: 3D-stacked memory and caches. We investigate the impact of extending the on-chip memory capabilities in future HPC-focused processors, particularly by 3D-stacked SRAM. First, we propose a method oblivious to the memory subsystem to gauge the upper-bound in performance improvements when data movement costs are eliminated. Then, using the gem5 simulator, we model two variants of LARC, a processor fabricated in 1.5 nm and enriched with high-capacity 3D-stacked cache. With a volume of experiments involving a board set of proxy-applications and benchmarks, we aim to reveal where HPC CPU performance could be circa 2028.
Speaker bio: Jens Domke is the Team Leader of the Supercomputing Performance Research Team at the RIKEN Center for Computational Science (R-CCS), Japan. He received his doctoral degree from the Technische Universität Dresden, Germany, in 2017 for his work on HPC routing algorithms and interconnects. Jens started his career in HPC in 2008, after he and a team of five students of the TU Dresden and Indiana University, won the Student Cluster Competition at SC08. Since then, he published dozens of peer-reviewed journal and conference articles. Jens contributed the DFSSSP and Nue routing algorithms to the subnet manager of InfiniBand, and built the first large-scale HyperX prototype at the Tokyo Institute of Technology. His research interests include system co-design, performance evaluation, extrapolation, and modelling, interconnect networks, and optimization of parallel applications and architectures.
Einbetten
Wordpress FAU Plugin
iFrame
Teilen